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Abstract. We prove L p estimates for a class of two-dimensional multilinear forms 
that naturally generalize (dyadic variants of) both classical paraproducts and the 
twisted paraproduct introduced in [5] and studied in [T] and [BJ. The method we use 
builds on the approach from [BJ and we present it as a rather general technique for 
proving estimates on dyadic multilinear operators. In the particular application to 
"generalized paraproducts" this method is combined with combinatorics of integer 
partitions. 



1. Introduction and formulation of the result 

Paraproducts first appeared in the work of Pommerenke [llj, who showed that 
h' = f'g implies ||^||h 2 < C ||/||bmoa||s i ||h 2 f° r analytic functions on the unit disk. 
They were named and used extensively by Bony [3] in the context of his theory of 
paradifferential operators. Since then, many variants have been studied and they have 
proven to be a useful concept in various mathematical disciplines. We choose the for- 
mulation appearing in [15]. A multilinear form is called a (classical) model paraproduct 
if it is given by 

n „ n 

aidpUi,..., /„)=x; i/r-* n^rf ) W)=Ei / i H / n/i(*>?w<ki> 

lev i=i lev J R " i=i 

where T> is the collection of dyadic intervals and each ipf 1 is a smooth bump function 
adaptecO to the interval /. We also assume that for each / at least two of the func- 
tions tpf \ . . . , ipj have mean zero. Classical Calderon-Zygmund theory establishes the 
estimates 



|Aidp(/i, . . . , f n )\ <n,(pi) 



8=1 



for the exponents satisfying Y™, — = 1 and 1 < Pi < oo; see [15]. One can consult 

— l l pi 



[13] for applications and [2] , [T] for more recent extensions and references. 
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1 Consult |15] for rigorous definitions of the notions bump function and adapted. This paper deals 
with dyadic models only. 

2 For two nonnegative quantities A and B, we write A <p B if A < CpB holds for some constant 
Cp > depending on a set of parameters P. 
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Significant conceptual complications that do not seem to have been extensively stud- 
ied arise as soon as one proceeds to higher dimensions. For expositional and notational 
simplicity we will only work with functions on IR 2 . Furthermore, our method signif- 
icantly uses the dyadic structure, so it is appropriate to replace each smooth bump 
function ip^p either by the Haar scaling function c^p or by the Haar wavelet ipf. The 
latter two are defined simply as 

^H/r 1/2 l/ and ^ D :=|/|- 1/2 (lz left -l Wt ). 

Here, Ji e f t and J rig ht respectively denote left and right halves of a dyadic interval / and 
we use the notation 1a for the characteristic function of a set A C IR. Note that the 
function ifjf has mean zero. 

An obvious two-dimensional analogue of the dyadic model paraproduct is 

a 2DP (f 1 ,...,f„)= y: ^"(n^^w^^ 

ixJeC ieS ie5 c 

= e i j i 2 ""/ (n^))(n^))(^ ( li ) 

where C denotes the collection of all dyadic squares in IR 2 , 

C:= {[2 fc Ji,2 fc (Zi + l)) x [2 fe Z 2 ,2 fc (Z 2 + l)) : Mi.feez}, 

and S C {l,2,...,n}, \S\ > 2, S c := {l,2,...,n}\ S. This object still falls into the 
realm of Calderon-Zygmund theory. 

More interesting analogue was proposed by Demeter and Thiele in [5], while inves- 
tigating the two-dimensional bilinear Hilbert transform. They raised a question of 
proving L p estimates for 

A TP (F, G,H)= [ V 2 2k ( [ F(x-s, y)p{2 k s)ds) ( [ G{x, y-t)^{2 k t)dt] H(x, y)dxdy, 

Jm 2 ^ vr ' VR ' 

where (p,ip are smooth bump functions and supp(^) C {(£l : |< |£| < 2}. After 
a conditional result by Bernicot pQ, first L p bounds for this multilinear form were 
established in [6J. The form is called the twisted paraproduct and its dyadic variant can 
be written as 

A DTP (F,G,H) = I F(x 2 ,y 1 )G{x 1 ,y 2 )H(x 1 ,y 1 ) 

/xJec^ R4 

ipf(x 1 )ipf(x 2 )^(y 1 )(p^(y 2 ) dx l dx 2 dy 1 dy 2 . (1.2) 

Indeed, the approach in [6j was to control the dyadic version first and then relate the 
continuous version to the dyadic one. 

In this paper we define a class of two-dimensional multilinear forms that naturally 
generalize both classical paraproducts (11 .ip and the twisted paraproduct (jl.2p . Then we 
prove L p estimates in a certain range of exponents depending on the structure of the par- 
ticular multilinear form. It turns out convenient to associate the new paraproduct-type 
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forms to finite bipartite undirected graphs. This already indicates a certain amount of 
combinatorial reasoning appearing in the proof. 

Let m, n be positive integers and choose 

EC{l,...,m}x{l,...,n}, SC{l,...,m}, TC{l,...,n}. 

It will be convenient to represent E as the set of edges of a simple bipartite undirected 
graph with vertices {xi, . . . , x m } and . . . , y n }, where Xi and r/j are connected by 
an edge if and only if e E. Also, we regard elements of S and T as "selected" 
vertices from these two vertex-sets respectively. We additionally require \S\ > 2 or 
\T\ > 2. 




Figure 1. Graph interpretation of a triple (E,S,T). Selected vertices 
are circled. 

To each such triple (E,S,T) we associate a multilinear form A = A e> s,t acting on 
E\ functions by 




(n^ D (4(n^(4(n^(%))(n^(%)) 

dxi . . .dx m dyi . . .dy n . 

ies ieS c jeT jeT c 

To make sure that the above summand is well-defined for each I x J e C and that all of 
the succeeding arguments are valid, we suppose that F it j are measurable, bounded, and 
compactly supported functions. Since this is only a qualitative assumption, any quan- 
titative bounds can be extended by density arguments. The normalization | j| 2 -( m +"-)/ 2 
is chosen so that A is invariant under simultaneous dyadic dilations: 

A((D 2l F itj ) {itj)eE ) = 2 2l A((F id ) {i , j)eE ), 

where I G Z and (D 2 iF)(x,y) := F{2~ l x,2- l y). 

Informally, we say that the functions appear in a certain "twisted", "entwined", or 
"entangled" way in the definition of A. One can describe the structure of A in words: 

(1) Every edge (xi,yj) contributes with a function F id . 

(2) Each vertex, X{ or yj, carries a "dyadic bump function" (either Lp° or ifP)- 

(3) Selected vertices carry "dyadic bump functions" of mean zero (i.e. i/j d ). 

(4) At least one bipartition class, {x±, . . . , x m } or {y 1 , . . . , y n }, contains at least two 
selected vertices. 
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The last condition is an analogue of the standard cancellation condition for classical 
paraproducts. 

Now we state a boundedness result for these forms. The bipartite graph determined 
by E splits into connected components, i.e. maximal connected subgraphs. Let dy 
denote larger size of the two bipartition classes of the connected component containing 
an edge (x^yj). 

Theorem 1.1. Let (E,S,T) and (^ij)(jj)£_E be as above. The series Y^ixJec defining 
A converges absolutely and the form A satisfies the estimate 

\ A (( F i,j)(iJ)eE) | ^m,n,(pi,j) Y\. W F ijW^ Pl 'H^ 2 ) ( 1 - 3 ) 

(i,i)e-E 

whenever the exponents (pi,j)(i,j)eE are su °h that j)^E = 1 an d dij<Pij<oo 
for each G E. 

Let us comment on a couple of already familiar particular instances. 
Classical paraproducts. m = n, E = {(i,i) : i G {1, . . . , n}}, \S\ > 2, T = 0, = 1. 

Xie yi 

x 2 e -x/2 

X 3 1/3 



Figure 2. Bipartite graph corresponding to a classical paraproduct. 

This special case leads to ordinary two-dimensional dyadic paraproducts ( jl.ip and 
Theorem 11.11 yields the inequality 

n 

|A 2 DP(^1, • • • , F n )\ < n ,(pi) Y\_ H^llL p i(K 2 ) 

i=l 

for EiLl ^ = 1 < Pi < °°. 

Twisted paraproduct. m = n = 2, E — {(1, 1), (1, 2), (2, 1)}, S = {1,2}, T = 0, 

dl,l = ^1,2 = ^2,1 = 2. 

xi®^ y 2 

FIGURE 3. Bipartite graph corresponding to the twisted paraproduct. 

This case is exactly the dyadic variant of the twisted paraproduct ( II. 2p and Theorem 
11.11 claims the estimate 

|A DT p(i ? , G, H)\ < Pl q,r ||-F||LP(R 2 )||G|| L 9(K2)||if || L r( R 2) 
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in the range ^ + ^ + ^ = 1, 2 <p, q,r <oo. "Fiber-wise" Calderon-Zygmund decomposi- 
tion of Bernicot [1] extends the estimate to the range l<p,q<oo, 2 < r < oo and even 
outside the Banach triangle. 

Since our method is naturally adjusted to dyadic scales, we confine ourselves to 
bounding dyadic model sums only and do not discuss their continuous analogues. Also 
note that the decomposition from [Tj does not apply in general, so there does not seem 
to be a pre-existing result allowing an extension of the exponent range. See Section [6] 
for further discussion of that matter. 

Sections [3] and 0] are devoted to the proof of Theorem 11.11 while Section [2] develops 
the general technique we apply in the proof. To aid understanding we explain the key 
steps on a concrete example in Section [5j The closing Section [6] contains remarks on 
limitations of the approach and further problems. 



In this section we set up the Bellman function scheme appropriate for proving certain 
estimates for multilinear operators acting on two-dimensional functions. We work in 
K 2 , but the material easily generalizes to higher dimensions. 

The Bellman function theory in harmonic analysis was invented by Burkholder [3] 
and developed in [S], 0, [TU], and the subsequent papers by the same authors and their 
collaborators. The main difference in our setup is that we do not insist on optimal- 
ity conditions and our approach can only be used to establish positive results about 
boundedness of operators. This simplifies the theory, since otherwise the corresponding 
Bellman functions in the sense of optimal control theory would necessarily have to be 
"infinite-dimensional", i.e. would not depend on finitely many scalar parameters. 

Each dyadic square Q e C partitions into four congruent dyadic squares that are 
called children of Q, and conversely, Q is said to be their parent. Our method requires 
working with structured families of dyadic squares called finite convex trees. A tree 
is a collection T of dyadic squares such that there exists Qj- G T, called the root of 
T, satisfying Q C Q T for every Q e T. A tree T is convex if Qi C Q 2 C Q 3 and 
Qi,Q3 £ T imply Q 2 G T. Informally, convex trees "do not skip any scales". A leaf 
of T is a square that is not contained in T, but its parent is. The family of leaves of 
T will be denoted C(T). Note that leaves of any finite convex tree partition the root. 

All of the following functions are assumed to be nonnegative, bounded, and measur- 
able. For a function / on a dyadic interval I we denote 



We prefer to emphasize the variable in which the average is taken, as for instance in 
[F(x,y)]xei an d (F(x,y)) xe z one deals with functions of more than one variable. We 
simply write [F(x, y)\ x and (F(x, y)) x if the interval J is a generic one or is understood. 



2. Bellman functions in multilinear setting 




G 



VJEKOSLAV KOVAC 




Figure 4. A finite convex tree (left) and partition of its root into leaves (right). 



Notational shortcuts such as 

(${x,x',y)) x j, y = ((moc,x',y)) xeI ) x , eI ) yeJ , 

[$(X!,X 2 , • • -)] XieI{ol = [...[[HX U X 2 ,.. .)] XieI ] X2eI • -.] XneI 

are also allowed. Finally, for a dyadic square Q = I x J and a function F on it we 
write 



[F]q '■= 7777 / F(x,y)dxdy= [F(x,y 
M Jo 



x£l,y£j' 



Let us now turn to (multilinear and multi-sublinear) operators we want to study. A 
broad class of interesting objects can be reduced to 

A r (F l5 . . . , F t ) := \Q\ A Q(Fi, ■ • ■ , F t ), (2.1) 

where T is a finite convex tree of dyadic squares and A = Aq(F\, ...,F{) is some 
(usually "scale-invariant") quantity depending on several two-dimensional functions 
Fi, . . . , Fi and a square Q e T . Several examples are the terms appearing in the right 
column of Table [TJ Sometimes we want to deal with sums over infinite collections of 
squares Q, so for this purpose we do not allow any constants to depend on T. 

Let B = Bq(Fi, . . . , Fi) be another quantity depending on Fx, . . . , Fi and QeT. 
We define the first order difference of B, denoted OB = OBq(Fi, . . . , F{), as 

DB IXJ (F h ...,F{):= jB IlettXJleft (Fi, ...,F{) + ^B Ileft xJ tisht (Fi, ■ ■ -,Fi) 

+ 1 bright x4f t (^i. ■■■,F l ) + ^^ rightX j right (-^i, --^Fi) 
-B IxJ {F u ...,F l ). 

For instance, 

B IxJ (F):= [[F(x,y)]l eI ] yeJ (2.2) 
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leads to 

VBikj(F) = \[[F(x,y)]l eh J yeJien + \[[F(x,y))l eIl J yeJiight 

+ \m^y)}lei Ii J ye j le(t + \[[F^y)}l €lri J y ^ ight - [[F(x,y)]l eI ] yeJ 

= \ [[F(x, y)]l eh J yeJ + \ [[Fix, y)]l Elli J yEJ - [[F(x, y)]l eI ] yeJ 

= 3[[F{x,y)] x ei{F{x,y)) 2 xeI \ yeJ . (2.3) 
Above we used obvious identities 

[f(x)]xei = l(lf(x)]xei lcft + [/(aOWri,,*), 

(/(»)>*£/ = \ (lf( x )}xei lcft ~ lf(x)}xei light ) , 

[/0)W/i eft = [f(x)]xei + {f(x))xei, 

lf( x )]xei lisht = \f{x)]xei ~ {f{x))xei, 
and ^(A+B) 3 + \{A— B) 3 — A 3 = 3AB 2 . Two more examples are provided in Tabled! 



B 


UB 


[[F(x,y)] 3 x ] y 


3[[F(x,y)] x (F(x,y)) 2 x ] y 


[F(x,y)]l y 


[{F(x, y)) x ]y + {[F(x,y)] x ) 2 y + (F(x,y))l jy 


[[F(x,y)G(x',y)]l] xxl 


[(F(x, y)G(x>, y)) y ] xx , + { [Fix, y)G(^y)]l) m x , 
+ ((Fix,y)Gix',y))l)^ xl 



Table 1. Sample table of first order differences. 



Let us now suppose that we have found a quantity B such that |*4.| < □£>, i.e. more 
precisely 

I^qOFi,.--,^)! < nB Q {F x ,...,Fi) (2.4) 
for all squares Q G T and any nonnegative bounded functions F±, . . . ,Fi. By fixing an 
/-tuple of functions, applying ( 12. 4 p to an arbitrary Q G T, and multiplying by \Q\ we 

get 

\Q\\M^ — ^i)\ < E IQIB^Fu...,^) - \Q\B Q iF 1 ,...,F l ). 

Q is a child of Q 

Summing over Q G T leads to 

\A r {F u ... i F l )\ < \Q\B Q {F ll ... 1 F l ) - IQtIBq^,...,^) (2.5) 

QeC(T) 

for A7- given by (12. ip . To verify ( 12. 5p . one only has to notice that each term 

\Q\B Q iF 1 ,...,F l ) for QeT\{Q r } 
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appears exactly once with a positive sign and exactly once with a negative sign and thus 
all terms but those appearing in (j2.5p cancel themselves. Here is where we crucially 
use the tree structure — a general collection of squares would not work. 

The quantity B can be called the Bellman function for A7-. It is certainly not unique 
and other properties for B required in the actual problem will further narrow the choice. 
Usefulness of (I2.5P is in the fact that it reduces controlling a multi-scale quantity A 7- to 
controlling two single-scale expressions: one on the level of the "finest scales" C(T) and 
another one on the level of the "roughest scale" Qj. Furthermore, if B is nonnegative, 
then the last term \Qr\ Bq t (Fi, . . . ,Fi) can be discarded. 

The computation of □£> might be quite tedious, even for very simple terms B. Fur- 
thermore, it might not be clear how to find any B such that \3B relates to a given 
term A. Fortunately, there is a special class of terms in which both of these tasks are 
rather straightforward. A paraproduct-type term is a quantity A = Ai x j{Fi, ■ ■ ■ ,Fi) 
that takes the form 

A= (...((... h(x u ...,x m ,yi, .,y n j) ■ ■ ■) ) ■ ■ ■) , (2.6) 

where each pair of parentheses (•) is replaced by a pair of brackets, either [■] or (■), 
with the same subscripted variable. Here, $(xi, . . . , x m , y 1 , . . . , y n ) depends on the 
functions F 1 , . . . , F t . An averaging paraproduct-type term is a paraproduct-type term 
B = Bi x j(Fi, . . . , F{) containing only brackets of type [•], i.e. 

B= $(x 1 ,...,x m ,y 1 ,...,y n ) . (2.7) 

£l, y\,...,y n (zJ 

Linear combinations of paraproduct-type terms are called paraproduct-type expressions. 
We also regard B = to be a (trivial) averaging paraproduct-type term. Nontrivial 
examples of paraproduct-type terms have already existed in the literature. For instance, 
the Gowers box norm used by Shkredov in [12] can be written as 



l^lla(/xj) = [F(x 1 ,y 1 )F(x 1 ,y 2 )F(x 2 ,yi)F(x2,y2^ 1 



Theorem 2.1. The first order difference of the averaging paraproduct-type term (12. 7p 
is a paraproduct-type expression given by the formula 



l ^( s 1 ) ■ • • 3 x mi 2/l3 ■ ■ ■ 3 Vn)J x . eI for i&S ^ 

SC{1 m}, TC{i,.., n } L %e J for J'GT J % £J for jeT c 

5|,|T|even, (5,T)^(0,0) 



Xi&I for idS c 



Before the proof, let us remark that we can extend the definition of paraproduct-type 
terms to formal finite products consisting of finitely many two-dimensional functions 
F, G, H, F±, F2, ■ ■ ■ in finitely many variables x, x' , y, y', x%, X2 ■ ■ ., with finitely many 
inserted brackets of two types, [•] and (•), each with a variable in its subscript, as 
long as they can be transformed into the standard form (12.61) by renaming duplicate 
variables. For example, (12.21) is just a shorter form of 



[F(x,y)] xe i [F(x,y)] xeI [F(x,y)} 



xel 



J y€J 
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and can be further rewritten as 

[ F(x u y)F(x2,y)F(x 3 ,y\ ] x ^^, yeJ - (2-8) 

All terms in Tabled] are paraproduct-type terms, but only the terms in the left column 
are averaging. The formula from Theorem 12.11 claims that OB is equal to the sum of 
all non-averaging paraproduct-type terms obtained by replacing some pairs of brackets 
of type [•] with pairs of brackets of type (•) in any possible way such that: 

(1) The number of replacements corresponding to variables in I is even. 

(2) The number of replacements corresponding to variables in J is even. 

(3) At least two replacements are made, i.e. the derived terms are not averaging. 

In particular, if B contains m brackets corresponding to variables in / and n brackets 
corresponding to variables in J, then OB will consist of 2 m+n_2 — 1 (possibly repeating) 
terms. For instance, in (12. 8p there are 3 brackets corresponding to variables in / and 
1 bracket corresponding to a variable in J, giving us only 2 3+1 ~ 2 — 1 = 3 possibilities. 
All 3 of these terms are equal and we arrive at (12. 3p . 

Proof of Theorem \2.1\ The claim can be rewritten (using the definition of OB) as 



4 • 


5 yi) 


■■'0")]*,, 


...,x m g/ lcft , yi,...,y n 




j X mi 2/l j 




■ ■■,x m €l lcit , yi,...,y n 


+ -[${x 1 ,... 


5 yii 




—,Xm£lii s ht> yit—>v 




5 yii 




—^mGirighti Vlt—tV 


- £ 


( 




%mi ?/!)•••) yn) 



(2.9) 



XiGl for ieS c ' 

5C{l,...,m}, TC{l,..,n} L ' ' % 6 J for jeT J % 6 J for jeT c 

S|,|T|even 

For the purpose of the proof, we denote #r := l/ lctt — l/ right and immediately observe 
that 

(/WU = [/(^WU. 1 

= [/(x)(l + ^))L e r I (2.10) 
Lf(*)W riglrt = [/(»)(! -^j].^ J 
We start from an obvious algebraic identity 

m n 

\{{l + ad I {x l ))){j[{l + ^ J {y 3 )))= «'Vl(n^(^))(lI^))' 

8=1 J=l ' SC{l,...,m} iSS jGT 

TC{l,...,n} 

proved simply by multiplying out the product on the left hand side. We choose four 
particular values for the parameters a and /3, 

(a, (3)E {(1,1), (1,-1), (-1,1), (-1,-1)}, 
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add up these four equalities, and divide by 4. Finally, we multiply by $( 

yi, ... , y n ), average over xi,...,x m G /, y%, . . . , y n G J, and use (I2.10p to get the 

desired Identity (EH}. □ 

Theorem 12.11 generalizes the "telescoping identity" from [6J. In order to control a 
given paraproduct-type term A, we first try to dominate it by "simpler" terms, where 
a certain notion of "complexity" is defined depending on the numbers of different 
functions/variables involved, or in a more refined way as in Section [3j If that is not 
possible, then we take B such that A appears as one of the summands in □£> and try 
to dominate the remaining terms in OB. 

Let us state a part of the presented proof as a separate lemma. 



Lemma 2.2. The following identity holds: 
^2 [^(ii,a;2,...,i m )^ 



SC{l,...,m} 
\S \ even 



Xi&I for i£S 



Xi£l for ieS c 



(2.11) 



Xi,X2,...,X m eIi 



M + -[*(x 1 ,X2,...,x m )] xi 



%2 i • ••i%m ^ ^right 



In particular, if ^(xi, x 2 , ■ ■ ■ , x m ) > 0, then the sum (12. lip will also be nonnegative. 
We remark that (12. lip is constructed by adding up all terms of the form 
( ... ( (y(x 1 ,x 2 ,...,x m )) ) ...) 

where an even number of pairs of parentheses (•) is replaced with pairs of brackets (•) 
and the remaining ones are replaced with pairs of brackets [•]. 



Proof of Lemma \2.<\ This time we use a simpler identity, 

m 

H(i+a$ I (x l ))= yi « isi n^)' 

i=l SC{l,..,m} ieS 

with a = ±1. The sum in question is 



SC{l,...,m} 
\S\ even 



.1 ; 



ies 



Xl,X2, ■■■,Xm£l 



W(X 1 ,X 2 , ...,I m ) 



i=l 



xi,x 2 ,---,x m £l 



ty(x 1 ,x 2 , ...,x m ) Yl(l-fii{xi)) 



1=1 



- xi,x 2 ,...,x m £l 



-[*(xi,x 2 , • • • 



i Xm )\x u x 2 ,...,x m el lctt + 2 [^( x i' x 2, • • ■! x m)] a . lia . 2r .. )a . m6Jr . ght - 
Theorem 12.11 and Lemma 12.21 will be applied successively in Sections [3] and EJ 



□ 
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3. A SINGLE TREE ESTIMATE 

The form A = A e ,s,t from Section [TJ can be rewritten using the notation from the 
previous section as 

QeC 



with 



for iGS 



XiGl for i€S c 



(i,j)£E yj£J far j€T y^eJ for j€T 

This formulation also justifies the factor |/| 2 -( m + n )/ 2 j n the definition. We may suppose 
that all the functions Fij are nonnegative, since otherwise they can be split into positive 
and negative (real and imaginary) parts and one uses multilinearity of the form. 

We need to introduce a local version of A, so for a finite convex tree of dyadic squares 
T we define 

A r((Fi,j)(i,j)eE) ■= ^ \Q\ \ AQ((Fij) {iJ)eE )\. 
QeT 

Notice an absolute value in the definition, which makes Aj- only multi-sublinear. 
The main step towards the proof of Theorem 11.11 is an estimate for a single tree. 

Proposition 3.1 (Single tree estimate). For any finite convex tree T with root Qj- 
and leaves C(T) we have 

Ar((^-)(M)^) <m,n\Qr\ II Q ^f {T) ^f]T' j ■ 

(i,i)e-E 

Note that the implicit constant is independent of the tree T and the functions Fij. 
We can allow it to depend on the graph, because for each pair (m, n) there are only 
finitely many choices for (E, S, T). Moreover, d^j are determined by the graph. 

By homogeneity of Estimate ( 13. ip and its invariance under dyadic dilations we can 
normalize the tree and the functions by \Qt\ — 1 an d 

max \F?Y] l ri dl ' 3 = 1 for every G E. (3.2) 
Thus, our task is to prove 

^■r((^i,i)(M')es) ^m,n 1- (3.3) 



3.1. Proof of Proposition 13. 11 for complete bipartite graphs. Most of our effort 
will be spent in this special case, when E — {1, . . . ,m} x {1, . . . ,n}. In particular, 
there is only one connected component and dij = max{m,72,}. Later we will reduce 
the general case to this one. 

We begin with a simple estimate for functions on a single square. 



12 



VJEKOSLAV KOVAC 



Lemma 3.2. For nonnegative functions G^j on a dyadic square I x J the following 
inequality holds, 



l<i<m 
l<j<™ 



el 

yi,...,y n eJ 



< n [<% 

l<i<m 



max{m,n} 



1/ max{m,n} 
IxJ 



Proof of Lemma \3.2[ Because of the obvious symmetry, we can assume m > n. With 
two applications of (generalized) Holder's inequality for n and m functions respectively, 
we estimate the left hand side as 



i=i j=i 

n ra 

=n[n[^y)ii 

3=1 i=l 



< 



yi,—,yn 



m n 

[nn[c«(*».»)"]«* 



i=l j=l 
n m 

< nn[[ G «(«)i:'*] 



yi,---,yn 



m/ni 1/m 

y 



3=1 i=l 



By Jensen's inequality for the power function with exponent — > 1 we have 

[Gy(^)i; /B < [G hJ (x,yr] x , 



so 



□ 



[[G^i/nnr^P^yr];;, 

which completes the proof. 

In the following discussion, all constructions and (implicit) constants are understood 
to depend on m and n, but not on the tree T or the functions F^j. We introduce the 
notion of a selective (m, n) -partition as a (2m + 2n)-tuple of integers 

p = (ai, . . . , a m ; bx,...,b n ; ati, . . . , a m ; /3 U . . . , /3 n ) (3.4) 

satisfying: 

(1) < ^ < ai for i = 1, . . . , m and < (3j < bj for j = 1, . . . , n, 

(2) ai + . . . + a m = m and b\ + . . . + b n — n, 

(3) «! + ... + a m and ft + . . . + ft are even, 

(4) «! + ... + a m ^ or ft + . . . + ft ^ 0. 

The set of all selective (m, n)-partitions will be denoted Q m ,n- To every p 6 fi m ,n we 
associate a paraproduct-type term *4.( p ) = ^/xj((^,j)(«,j)e£) by 



^ (p) - 



n n*ww)>. 



l<j<m l</t<ai 
l<j<n l<v<bj 



4^ el for all 
such that l</i<«i 

J/^'eJ for all fj» J/^eJ for all (j,u) 

such that l<v<[3j such that /3 j + 1 < < b j 



x^el for all ' 
such that a;+l</^<(2; 



Pairs with aj = or bj = do not exist in the above product as we interpret 
(sub)products over empty ranges to be identically 1. In words, we average the product 
of ran terms of the pattern Fij(xi,yj) that contains precisely copies of Xi and bj 
copies of yj. Averages of type (-) x . are taken over oti of the x^'s (i.e. these x^'s are 



BELLMAN FUNCTION TECHNIQUE FOR MULTILINEAR ESTIMATES 



13 



"selected"), while averages of type [-] Xi are taken over aj— a.i remaining ones. Similarly 
for y/s. For instance, to 

p= (2,0; 2,0,1; 0,0; 1,0,1) 6^,3 

we associate 

A^=[(F lA (x 1 ,y 1 )F 1>1 (x 1 ,y[)F 1 4x 1 ,y 3 )F lA (x[,y l ^ 

For p e fl mjn given by (13. 4p we define the composition type of p to be the vector of 
first m + n components, 

comp(p) := (ai, ...,a m ; bi,...,b n ), 

and the partition type of p (and A^) to be an (m + n)-tuple 

part(p) := «...,<,; 6*,..., 6*), 

where a*, . . . , a* m is the decreasing rearrangement!! of ai, . . . , a m and 6*, . . . , 6* is the 
decreasing rearrangement of bi,...,b n . The set of all these partition types will be 
denoted £l* mn - Note that £l* mn has p#(m)p # (n) elements, where p#(n) denotes the 
number of distinct order-independent positive integer partitions of n, i.e. the number 
of Young diagrams with n boxes. Actually, we only use that the cardinalities |O min | 
and J are finite numbers depending solely on m, n. 

Finally, we define a strict total order relation -< on Q* mn simply as the restriction 
of the inverse of the lexicographical order on (m + n)-tuples of integers]^ Since every 
finite totally ordered set is isomorphic to an initial segment of positive integers, we 
have a natural rank (i.e. order) function, ord: Q* mn — > {1, 2, . . . , p#(m)p#(n)}. Let us 
simply write ord(p) for ord(part(p)) . For example, the total order on fi^ 3 an d its 
rank function are 

(2,0; 3, 0,0)^(2,0; 2, 1,0)^(2,0; 1,1, 1)^(1,1; 3, 0,0)^(1,1; 2, 1,0)^(1,1; 1,1,1). 
ord = 1 ord = 2 ord = 3 ord = 4 ord = 5 ord = 6 

Our goal is to dominate all terms A^ by OB for some averaging paraproduct-type 
expression B = ^((i^j)^-)^) that is controlled in the sense 

QeTUCm \ Bc *(( Fi ^M)eE) I <m,n 1. (3.5) 

This expression B will be the desired Bellman function. The goal will be achieved 
by mathematical induction on ord(p) and for this we will need the following crucial 
reduction lemma. 



3 This means: a* > . . . > a* 4 and a*, . . . , is a permutation of the multiset 01, . . . , a m . 

■^Lexicographical order on partitions of a single positive integer extends the common dominance 
order, which is only a partial order. Even though the latter one is already strong enough for intended 
purpose, we prefer to have linear order to avoid invoking well-founded induction in the proof. For the 
same reason we decide to order pairs of partitions totally (for two numbers m and n), although we 
will only need to compare partitions of a single number. 



14 



VJEKOSLAV KOVAC 



Lemma 3.3. For any p G Q m ,n there exists an averaging paraproduct-type term B^ = 
Bq^ {(Fij) (i,j)eE) satisfying (13. 5 p and such that for any < 5 < 1 we have the estimate 



p)i 



ord(p)<ord(p) 



ord(p)>ord(p) 



with some constant Cm „ > 0. 



Proof of Lemma \3.3[ We distinguish two cases depending on positions of selected ver- 
tices, i.e. on the last m + n coordinates in (13. 4p . 

Case 1. a.i ^ for at least two indices i G {1, . . . , m} or j3j ^ for at least two 
indices j G {1, . . . , n}. 

By symmetry we may assume that ai,a 2 > 1 and ai > a 2 . In this case we simply 
take £(p) = 0. Using \(f(y)) v \ < [\f(y)\) y and \AB\ < ±A 2 + \B 2 we estimate: 



|-4 (p) | < 



n F ^yf ] )) xi { n f ^ 



l<i<n 
KuKbi 



l<j<n 



x 2 



Ku<bi 



II \ U F ^y^) 



(i,M)^(l,l),(2,l) l<j<n 
Ki/<6,- 



all {/.' 



1 

< — 



5 

+ 2 



(4 



n F 2,j( x 2,Vj 



l<7<n 

l<w<6< 



3-2 



» 



(*, /i )#(l,l),(2,l) l<i<n 



- all j/ 



(") 



II [ 



(i,/^(l,l),(2,l) l<J<n 



all j/ 



Here, p, p G fl m ,n are defined as follows. If p is given by a (2m + 2n)-tuple (13.41) . then 
p and p will have coordinates: 



ai — 1, for i = 1, 
a 2 + 1, for i = 2, 

aj, for 2^1,2, 
2, for i = 2, 
0, for i ^ 2, 



ai + 1, for i = 1, 
a 2 — 1, for i = 2, aj = 

dj, for i 7^ 1, 2, 
2, for i = 1, 

0, forj ^ 1, ai = 

^ = = ^j'j = = o f° r ever y j- 

Observe that a\ appears to the left from a 2 in the list a\ > ... > a* m . Therefore 
simultaneously increasing a\ by 1 and decreasing a 2 by 1 we produce a lexicographically 
larger partition of m, so we conclude ord(p) < ord(p). On the other hand, both 
ord(p) < ord(p) and ord(p) > ord(p) are possible, where in the former case we use 
5 < S-\ 
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Case 2. a, 7^ for at most one index % G {1, . . . , m} and (3j 7^ for at most one index 
j G {l,...,n}. 

Without loss of generality we assume a, = for i ^ 1 and (3j — for j 7^ 1. Note 
that ai and j3\ are even and at least one of them is nonzero, say a\ > 2. Since 



|^ (p) | < 



n ^M-<h : ) 



i<i<™ 

Ku<b n 



n [ u^M'W 



(i,l»)^(l,l),(l,2) l<i<n 



» 



J all y 



Ku<bi 



we can also assume «i = 2, /?i = 0. Consider 



(p) . 



n n ^-(^yf)' 



l<i<m l</i<ai 



xf^ei for all 
y { " ] £j for all (j» 



Indeed, depends only on q = comp(p). Observe that Lemma [3.21 and Normaliza- 
tion (13.21) guarantee Condition (13. 5ft . Theorem 12.11 gives the equality 



™ ,p, = E ( D q )- 4,p '' 



(3.6) 



where 



comp(p')=q 



1=1 



counts the repeating terms A^ p '\ Note that 1 < < mj „ 1. Let us split the summa- 
tion set 



^m,n, q := {p' e ^ m , n : comp(p') = q} 



into three parts, 



m,n,q 


= {p' 


^ ^m,n,q • 


a- 7^ for exactly one i and 


= for every 


m,n,q 


= {p' 


£ ^m,n,q • 


/3j 7^ for exactly one j } , 




m,n,q 


= {p' 


^ ^m,n,q • 


a- 7^ for at least two i and 


= for every j } 



U jp' G ^ mjnjq : /3j 7^ for at least two j}. 



First, observe that p G fim)n,q and that 



l<i<n 
l< i /<6 J 



10 such that l<i<n 

i^io or fi>2v !<^<bj 



» 



> 



all y 



(") 



for every p' G f2m, n ,q, where i is chosen such that a[ = 2a 7^ 0. In particular, 



o<^p)< £ ( q )^ (p,) . 



(3.7) 
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Next, Lemma [2.21 applied to 



¥ = *((xf ; )l<i<m, 
n IAj / 2 J 



E E 

jo=l r=l 



"Jo 

2r 



l<i<m 
l</i<a; 



2r 



n^o(4 M) ,%o)). n I x\ F ^yf ] ) 

Vj ° (j,u) such that l<i<m 
j^j or u>2t l<A»<Qi 



> 



(where f3' jo = 2r ^ 0) yields 



5 (p')" 41 "'- ' 



Finally, each ^ p ') for p' £ ftS„, q can be controlled as in Case 1 to obtain 
|^(p')| < s -i \A®\ + 6 



ord(p)<ord(p) 



p (z ^ m , n 

ord(p)>ord(p) 



(3.? 



(3.9) 



□ 



Combining n3.6f) — ( T3T9T) proves the stated inequality. 

Figure |5] depicts partition types in fl^ 4 an d ways of controlling ,4( p ) as in Case 1 
of the previous proof. Different kinds of arrows represent different possibilities for 
various p £ ^2,4 with the same part(p). Labels 5 and <5 _1 represent coefficients in the 
"reduction inequality" for ,4.( p ). It is important that always at least one arrow (the one 
marked by d^ 1 ) points to a partition type with smaller rank, i.e. points downwards or 
to the right in the picture. 



(1,1; 1,1,1,1) 
la,*- 1 
2,1,1,0) 

5-1 ( (1,1; 2,2,0,0) 



(1,1; 3,1,0,0) 

8- 1 



6,6~ 



6,6~ 



6,6~ 



6,6- 



6,6~ 



(2,0; 1,1,1,1) 

(2,0; 2,1,1,0) Q S 

(2.0: 2,2,0,0) j ,v 

6,6- 1 
(2.0: 3,1,0,0) 

a- 1 




(1,1; 4,0,0,0)- -'- - -(2,0; 4,0,0,0) 
Figure 5. Recursive control of partition types in ^4. 
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Lemma 3.4. There exists a "universal" averaging paraproduct-type expression 
g(m,n) _ B^™'j\(Fij)(i tj ) eE ) satisfying (ELD and 

l^ (p) l < UB M . 

Proof of Lemma \3.J\ We prove the following claim by mathematical induction on k £ 
{0, 1,2,..., p#(m)p#(n)}: For every < e < 1 there exists an averaging paraproduct- 
type expression B^ K,£ ' satisfying 

j(k,£) , 



max \B [ Q' £) ((F id ) {Lj)eE )\ < TO , n , £ 1 



and 



p)i 



X |^ (p) | < Di3 (K ' £) + £ 

pGfim,n P£^m,n 

ord(p)<K ord(p)>K 

The bound we need to prove is a special instance for k = p#(m)p#(n) and any e, 
since then the sum on the right disappears. On the other hand, the sum on the left is 
for k = 0, so the inequality (trivially) holds with £>( K,£ ) = 0. We take k = as the 
induction basis. 

In the induction step we suppose that the claim holds for some <K<p#(m)p#(n)—l. 
To prove the claim for k + 1 take arbitrary < e < 1. Let C m n be the constant from 

Lemma [331 Induction hypothesis applied with e' = (^ — %~ — r ) 2 gives B^' 6 ^ such 
that 



Y \av\ < ubw + (^^) 2 E i^ (p) i- ( 3 - 10 ) 

pGfim,n P£^m,n 

ord(p)<K ord(p)>K 

Lemma 1331 applied to each p £ fl mn , ord(p) < re+ 1, and 6 = -r^ — %~ — , yields 



pen 

ord(p)<K+l ord(p)<K+l ord(p)<K 



(p)i 



which combined with f)3.10p leads to 



i^ (p) i - Yl nBiP) + "1 Dg(K,£/) + i Yl i^ (p) i- 

pGHm.n p£f!m,n pGS^m,n 

ord(p)<K+l ord(p)<K+l 

It remains to split ^ pg ^ m n on the right hand side according to ord(p) < k + 1 or 
ord(p) > k + 1, move the former summands to the left hand side, and multiply the 
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inequality by 2. We setH 



— B^ £ '\ 



□ 



ord(p)<K+l 



Let us finally complete the proof of Proposition 13. 1 1 in this special case. Even though 
the general term A of A is not necessarily among A^ p \ p G f2 TO , n , it can still be easily 
dominated by two of these terms. Remember that \S\ > 2 or |T| > 2, so without loss 
of generality suppose \S\ > 2 and take k,i2 6 S 1 , i\ ^ i<i- 



\A\< 



< 



n ^ j » ) ) ( n j' > ) ) ; n i n f « > ) 

l<j<n Xil l<j<n Xi2 



yi,—,yn 



l<j<n 



1 

+ 2 



rnn^ 



Above p, p6 f2 m „ are given by their components: 



yi,—,yn 



2^ 



2 A 



a, 





for i 


= k, 




for i 


= k, 




for i 






for i 


= ii, 




for i 






Pr- 


= ^ = 



a; 



for i — ii, 
for 1 = 12, 
for i 7^ zi, i 2 , 
for i = i 2 , 
for i ^ i 2 , 



Lemma [3.4[ Bound (I3.5p . and the general Estimate (12. 5 p establish (13. 3p . 



3.2. Proof of Proposition 13.11 in full generality. Let us now consider a general 
triple (E, S, T). Suppose that there are k connected components that contain at least 
one edge and list their vertex- sets as A"i U Yi, X 2 U Y 2 , . . . , X k U Y k , where Xi C 
{x\, . . . , x m } and Yi C . . . , y n } for / = 1, . . . , /c. The numbers dy were defined to 
be 

dy = dW :=max{|A%|l,|} 
if Xi G X;, yj G Y/ for some / G {1, 2, . . . , k}. Since c/jj depends on the component 
only, we sometimes prefer to write it as d®. For notational convenience we also denote 

Xi :={ie{l,...,m} : Xi G AT/}, y, := {j G {1, . . . ,n} : % G 
for I = 1, . . . , k. 

Vertices in {x\, . . . , x m } \ (Xi U . . .UX^) and {y\, . . . , y n } \ (Yi U . . .Ulfc) are isolated, 
i.e. no edge is incident to any of them. Note that isolated vertices contribute to the 



5 From this recurrence relation for ,£j( re > E ) we see that the final Bellman function is assembled as a 
linear combination of familiar terms £>( p ', but the coefficients can theoretically grow "very rapidly" 
and might be hard to track down explicitly in concrete examples. 
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form A in a trivial way. If Xj is a non-selected isolated vertex, then no functions F^j 
will have variable, so the contribution of X{ is only in the factor 



|/| 2 / (pf(xi)dxi = I. 
Jr 



On the other hand, if Xi is a selected isolated vertex, then that factor will be 

/ ^Y{xi)dXi = 0, 



so consequently A = 0. The same reasoning holds for isolated vertices yj. Therefore, 
from now on we assume that the associated bipartite graph has no isolated vertices. 

By taking F^j = 1 whenever E and "completing" each component of the 

graph, we can assume 

£ = ^x^)U...U(^x^). 

Note that the completed graph still has the same characteristic quantities d%j and that 
functions F it j identically equal to 1 do not contribute to either of the sides in (13.11) . 
Connected components are useful because the associated forms Aj- split as 

k 

A T ((F id ) (id)eE ) := \Q\U\ A Q(^(^^)l 
QeT 1=1 



with 



•A\xj — ( J J E,j{Xi,Vj) 



Xi&I for i&S c nXt ■ 



ieXijeyi W eJ for JtTnyi yjEJ for ieT c ny ; 

We distinguish two cases with respect to the distribution of selected vertices in the 
graph. Recall once again that |S'|>2or|T|>2, which guarantees that the following 
two cases cover all possibilities, although not necessarily disjoint ones. 

Case 1. \S fl X\ \ > 2 or \T fl 3\J — 2 for some index l G {1, . . . , k}. 

In words, some connected component contains two selected vertices in one of its 
bipartition classes. From the previously proven case of Proposition 13.11 applied to the 
complete bipartite graph with vertex-sets Xi ,Yi we have the estimate 

ElQlkS 0) ((^)(^ x3;J| < m ,n\Qr\ II o^m^X d{l0) = l - 

QeT iex kr jey k , 1 ' 

Applying Lemma [3721 to the functions (Fi t j)i e x t ,jeyi we § e t 

i^((^-wx M )i<[ n FiA^yj)}^ to* < n khst^ 1 

for each Q = I x J e T and each I ^ l . This establishes f)3.3p . 
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Case 2. S n X h ^ ^ S D or T n 3^ ^ ^ T D for two different indices 

In words, there exist two connected components each containing at least one selected 
x-vertex or each containing at least one selected y-vertex. Without loss of generality 
suppose S H X\ 7^ and S D ^ 2 7^ 0. Using Lemma 13.21 we can estimate the general 
term as 

\A\ = |^«|i^ 2 )||^|...|^«| < I^H^I < \{A^) 2 + \{A^) 2 . 

By symmetry it is enough to handle (A^) 2 and by renaming vertices we may assume 
X x = {1, . . . , mi}, y x = {1, ■ ■ ■ , n x }, and 1 E S. 

If mi = m = 1, then from the second row of Table [TJ, 

(A^) 2 = [(F x , x (x x , yx )) xl ] 2 yi or (F^ift))* 



■yi 



< [(Fi 5 i(xi, yi)) a;i ] 2 i + (F^yi)) 2 ^ + ([F ltl (x 1 ,y 1 )] Xl ) 2 yi = □( [Fi.i^i, m)} 2 Xl>yi ) . 

We separate this special case because now dP^ = 1 (as for classical paraproducts), so 
the normalization (13.21) does not control averages of higher powers of F xx . However, 
the first power is enough here: 



max Bq{F 1>1 ) = ( max [F 1)X 



QeTu£(T) 



V QeT u£(T) 



1. 



On the other hand, the condition S 1 ^ > 2 is ensured if mi = 1, n x > 2, so then we 
bound (.4 (1) ) 2 as 

2 



YlF hj ( Xl , yj ) 

i=i 



< 



VU — >Vn 



Y[F h] (x uyj ) 



yi,-,y ni 



We can recognize the last term as A^' for p E fi 2)m given by ai = «i = 2, a 2 = a 2 = 0, 
= 1, § = for 1 < j < n x . Thus, („4 (1) ) 2 < ^1 ( p } < Di3 (2 ' ni) , where -B (2 '" l} is from 
Lemma I3.4L 

If mi > 2, then one can write with the help of the Cauchy-Schwarz inequality and 
Lemma [3.21 once again: 



(^ (1) ) 2 < 



ni ni _ mi ni 

n 7 ''^-'-^),.. [n^^^L n[n F «^'%') 



< 



3=1 



3=1 

mi n-1 



Yl F hj( x i,yj)) x II [11^'^'%') 

j=l i=3 3=1 



i=3 j=l 



J Vl,—,Vn 



^(p)<DB('«i>"i) 
mi ni 



ni 2 mi ni 

[Y[F 2 ,(x 2 , yj )]^ U[HF hj (x hyj ) 

j=l i=3 j=l 



yi,—,yni 



— 

<i 
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where p G fi mi , ni has coordinates 

2, for i — 1, f , 

n f • o - I 2, for z = 1, 

(J, for z = 2, ctj = 



, , i 0, for 2 < z < mi, 

1, tor 3 < z < mi, k 

bj = 1, ^ = for 1 < j < m 

and 0( m i> ni ) is from Lemma [3.41 

In all three possibilities above the proof is finished by invoking Estimate (12. 5p . 

4. Completing the proof of the main theorem 

To establish the global estimate we adapt the approach by Thiele from j!4j . 

Proof of Theorem Fix a positive integer N and consider only squares with side- 
length at least 2~ N , 

C N := {Q = / x J eC : \I\ = \J\> 2- N ^. 

We prove the bound 

/AQ\ \^Q{( F i,j)(i,j)eE)\ Sn,n,(pi,j) J| ll^ij Ik^.J (M^)) ( 4 -l) 
QeCiv (i,j)eE 

with the implicit constant independent of N, so that it implies the result for the whole 
collection C. Using homogeneity, this time we normalize 

ll^,illL p ^(R 2 ) = 1 for ever y (hj) e e. 

For each |£7|-tuple of integers k = {hj)^)^ £ we denote 
V k := \Q G C N : 2 fe - < sup [^' j ]' Mj < 2 fc - +1 for every G 

Note that squares in 7\ satisfy |Q| < 2~ PiJ '( fei ^~ 1 -' for any G which limits 

their size from above. To verify this, we take Q G Pk and choose Q' D Q such that 
[■^i' J ] Q'^ 3 > 2 fci ' j_1 . By the monotonicity of normalized L p norms 

2*«-i < < [F^r j = \Qr 1/p ^mj L ^ m < \Qr 1/pi ' j 

and thus \Q\ < \Q'\ < 

Define .A/lk to be the collection of maximal squares in 7\ with respect to the set 
inclusion. For each Q G .Mk the family 

T Q := {Q G P k : Q C Q} 

is a finite convex^] tree with root Q and for different squares Q E Ai^ the corresponding 
trees cover disjoint regions in the plane. For Q G Tq by the construction of we have 



s Convexity of Tq follows from monotonicity of Q i— > supQ, gCw Q'^>Q' 
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[ f m 3 ]T'' j < 2fci,j+1 - Also ' if ^ G c ^ Tq) and ^ is the parent of then t^]^ - 

4[^]^ di ' j <2 fc ^+ 3 . Therefore, Proposition O gives 
We decompose^ and estimate, 

J2\Q\\M( F Mh,j)eE)\ = Yl E A r Q ((^)(M)^) 



<m,» £ - )>: ' ' ' ( £)|Q|). (4-2) 



For any d > 1 it is a classical resu that "power <i variant" of the dyadic maximal 
function 

(M d F)(x,y) := snp [|F|<% /d 

QeC, Q9(a;,j/) 

is bounded on L P (R 2 ) whenever d < p < oo. We have l) E ^ = U(i j )es^(ioJo)> where 
the subsets /C( ioJO ) are defined by 

£(«,*>) := ( k = (hj)(i,ME ■ Pio,jo k i ,jo > Vidhj for every e E) . 
Observe that for (x, y) e Q eV^we have by the definition of V^, 

(M d ^F l0M )(x,y)> sup [F^X°}q> 30 >^ , 
which together with disjointness of A^k yields 

E \Q\ = I U Q\ ^ Kfov) e r2 ■ ( M ^Fiojo)(^y) > ^°}\ (4.3) 

QeM k QeMk 

for any choice of (zo, jo) £ E. 

Combining (14. 2 p and (14.31) allows the final computation: 

E^I|^(^)W)^)I E E 2E(«) 6 -*m |{M rfi0]i F i0)io >2 fe -o}| 



^ £ 2 W^o I {M di0ij F JoJO >2 fc >o,o} | J] £ 2 



P »Q,jO fc »0.jQ 



^1 E E 2^o,o^o.o |{M d!oj F JoJO > 2^.0} | 

< \ 1 1 tv /r rp llP'OJO < \ r II p hP'o^o <i i 1 

r^{Pi,j) / j II 1Vld i ,30 ? OJ'oIIl p *OJO(R2) ), (pi j ) / 1 1 C «0 JO 1 1 \fi , j (R2 ) 5 



(«o,io)G-B (io,io)e-B 



7 Here we use that for each Q E C N \ Ukezi B a ^ l eas t one of the functions constantly vanishes 
on Q, so the corresponding summand is equal to 0. 

8 One simply writes MdF = (Mi |F| d ) 1 ^ d , where Mi is the standard dyadic maximal function. 
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which is exactly ( 14. ip . We used ^2^^ £E — = 1 and added up \E\ — 1 geometric series 



with initial terms in (~, 1] and common ratios equal to \. 



□ 



5. AN EXAMPLE THAT ILLUSTRATES THE PROOF 

In this short section we show how important steps of the proof should be performed 
on a concrete example given in Figure [6j 



x 7 




Figure 6. An example of a bipartite graph and its splitting. 

The example has parameters 

m = 7, n = Q, E = {(1, 1), (1, 2), (1, 3), (2, 1), (2, 2), (3, 4), (4, 4), (4, 5), (5, 6)}, 
S = 0, T={1,4}, d 1A = di t 2 = di t3 = d 2t i = d 2 ,2 = 3, 4,4 = 4,4 = 4,5 = 2, 4,6 = 1; 

the splitting into connected components is 

#i = {l,2}, y 1 = {1,2,3}, X 2 = {3,4}, y 2 = {4, 5}, X 3 = {5}, ^ 3 = {6}; 

and the forms we associate to this graph decompose as 

Ar=El^lK 1) lKlK 3) |' 

QeT 



where 



A (l) = [(F 1A (x 1 ,y 1 )F 1>2 (xi,y 2 )F h3 (x 1 , y 3 )F 2A (x 2 , Vi)F 2>2 (x 2 , 2/2))^]^ 
A {2) = [(F 3A (x 3 ,y 4 )F 4A (x 4 ,y 4 )F 4 ^(x 4 ,y 5 )) m ] 
A {3) = [F 5i6 (x 5 ,y 6 



j/4 J X3,X4,ys 



In order to deal with more symmetric situation, we "complete" the graph by intro- 
ducing F 2j 3 = 1 and F 3; 5 = 1. Since \T\ = 2, we decide to rewrite 



■A {1) = (Fi,i(xi,yi)F 2A (x 2 ,yi)) yi [F 1>2 (xi, 2/2)^2,2(^2, y 2 )] y2 [Fi >3 (xi, 2/3)^2,3(2:2, 2/3)]^ 
A {2) = (F 3A {x 3 ,y 4 )F 4A {x 4 ,y 4 )) \F 3 ^(x 3 ,y b )F 4 ^(x 4 ,y b )] 



X1,X2 



X3,X4 
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and start by estimating the general term as in Subsection 13.21 Case 2, 
\A\ = \AM\ \A<V\\A®\ < \{A^f + \[A®f 



< 



Fi,i(x 1 ,y 1 )F 2<1 (x 2 , yi)) 2 yi [Fi,3{xi, 2/3)^2,3(2:2, 2/3)]^ 



AW 



[-^1,2(2:1, 2/2)^2,2(2:2, 2/2)]^ [^1,3(2:1, 2/3)^2,3(2:2, 2/3)] w 



(^3,4(^3,2/4)^4,4(2:4,2/4) 



V4 



<1 



23,24 



[^3,5(«3, 2/5)-Kl,5(2:4, 2/5)] 



23,24 



<1 



Let us proceed with ^4.( 4 \ since is in the third row of Table [I] and can be treated 
even more easily. Following the approach from the proof of Lemma 13. 3[ Case 2, we 
define 



0(4) 



[Fi,i(xi, 2/1)^2,1(2:2, 2/1)] ^ [^1,3(2:1, 2/3)^2,3(2:2, 2/3)] 



21, 2 2 



[-Pl,l(^l, 2/l)^l,l(2:i, ^1)^1,3(2:1, 2/3)] ^[-^2,1 (^2,2/1)^2,1(2:2,2/1)^2,3(2:2,2/3)]^ 

By Theorem 12. II every term in OB^ is either A^\ or 

A {6) = (F hl (xi,y 1 )F 2il (x 2 ,yi)) yi [F 1 ^(x 1 ,y 1 )F 2A (x 2 ,y 1 )] yi (F li3 (x 1 ,y 3 )F 2 ^ 3 (x 2 ,y 3 )) y3 
or is of the form 

A {7) = (<Fi il (xi,yi)Fi i i(xi,yi)Fi i3 (xi,y 3 )> :Cl <i 71 2,i(2:2, 2/1)^2,1(^2, 2/1)^2,3(2:2,2/3)} 



£1,2:2 



where parentheses (•) are understood as either [•] or (•) for each of the variables yi,y[, 2/3 
independently. We dominate A^\ A^ as in the proof of Lemma [3.31 Case 1. 



I^ (6) l < y 5 [< F i,i(^ ^2,1(2:2,2/1)}^ [^1,1(2:1,2/1)^2,1(2:2,2/1)] 



Xl,X2 



AW 



+ 2 



(^1,3(2:1,2/3)^2,3(2:2,2/3)}^ [-^1,1(2:1,2/1)^2,1(2:2,2/1)] 



1 .VI 



AW 



\A^\ < 



2 
1 

+ 2 



[Fi,i (2:1 , 2/1 )-Fi,i (xi , y[ )Fi )3 (xi , y 3 ) }^ 



,4(10) 



(-^2,1(2:2, 2/1)^2,1(2:2, 2/1)^2,3(2:2, 2/3))^ 



AW 



Since ^l^ 9 -' is analogous to A^\ the algorithm loops (with a small "weight" 5) and we 
continue with A^ only. Also, by symmetry, we can consider .4.( 10 ) and disregard A^. 



BELLMAN FUNCTION TECHNIQUE FOR MULTILINEAR ESTIMATES 

To control A^ we define 



2- r , 



0(8) 



[F 1)1 (x 1 ,yi)F 2) i{x2,yi)] 



f/i 



[Fi,i( x i,yi)Fi,i(xi,y[)Fi,i(xi,yi)] Xl [F2,i{x 2 ,y l )F 2il (x 2 ,y[)F 2tl (x 2 ,y'{)] 
Every term in □£>( 8 - ) different from A^ is of the shape 

A {12) = (^(Fi t i(xi,yi)Fi i i(xi,y' 1 )Fi i i(xi,yi)) x ^ (F 2i i(x2, yi)F 2i i(x2, yi)F 2i i(x2, y") 
and is estimated as 



J-2 



X2 



yi,y[,y" 



1 



|-4 (12) |< - <^,i(xi ) yi)F ljl (x 1) yi)^i,i(^>yi , )>i 



2 L 
1 

+ 2 



yi>y{,y" 



,4(13) 



(-^2,1(^2, yi)F 2 ,i(x 2 , y[)F 2 ,i (x 2 , y")) 2 X2 
Finally, ^4^ 13 ^ corresponds to a partition with the smallest rank. We introduce 



vi,y'i,y" 



B (13) 



!/i>yi>j/i 



[^1,1(^1) 2/1)^1,1(^1) yi)] 



in. 



and observe that by Theorem 12.11 each term in IZ]£>( 13 ) different from ,4.( 13 ) is of the 
form 

[F 1A (x 1 , y^F^ix'^ yy)) 2 yi [F 1A (x 1 , y^F^fa, y x ) 



yi 



XI, x ± 



Sum of these terms is nonnegative by Lemma 12.21 and can be discarded as it only 
increases 

On the other hand, to deal with A^ we define 



yuy v y3 



B {10) = [[F^ixt^Ft^xuy^F^xuya)] 2 ^ 

= [-Pi,i(^l,yi)^i,i(4>yi)]^ [ F i,3(x 1 ,y 3 )F lj3 (x[,y 3 )] y:} 
Every term in HIB^ other than ^4.^ 10 ^ takes either the shape 



Xl ,x\ 



Fx,i{x 1 ,y 1 )F lil {x' 1 ,y 1 )) 2 yx [F h3 (x 1 ,y 3 ,)F h3 (x' 1 ,y 3 )] 



!/■>, 



xi ,x\ 



or the shape 

(F ltl (x 1 ,y 1 )F 1 , 1 (x' 1 ,y 1 )) yi [Fi t i(xi,yi)Fi tl (x[,yi)] yi {F lj3 (xi,y 3 )F lt3 (x[,y 3 )) y3 



X\ ,x^ 



Sum of the former terms is nonnegative by Lemma I272| while the latter ones are treated 
similarly as A^ . 

A concrete Bellman function B satisfying \A\ < OB and (13. 5p under Normalization 
(13. 2p can be assembled as a linear combination of all averaging paraproduct-type terms 
that appear in the proof, including the omitted ones. Estimate (12.51) gives (13.31) once 
again. 
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6. Several remarks 

6.1. Comments on the exponent range. Already the twisted paraproduct shows 
that the range of exponents in (11 .3p is not optimal. Our restriction of the range comes 
from the fact that the proof of (13. ip for a single connected component gradually reduces 
the term A to paraproduct-type terms with "more and more" repeating functions. In 
the worst case we end up with only one function, say F^i, appearing ran times. This 
means that the byproduct of such proof is 



E \ Q \ Sn,« \Qt\ ( max [i^J 

f-t -> v ' V QeT U£(T) 



mn 



where 



A IXJ (F 1A ,...,F 1A )= [( U F ^ x ^) Xi 



€1 for ieS x t el for ies c 
l<i<m ' Vi e J for j ST Vj G J for j eT c 

l<J<n 



and d = a?i,i. Taking T to be a single square |[0,2) 2 } and F^x to be a nonnegative 
elementary tensor F^i(x,y) = f(x)g(y) supported on [0, l) 2 we conclude 



U J [0,1) l J J [0,1) ~ m - n ^ l J J [0,1) \P J [0,1 



mn 



which is only possible if d > max{m, n}. 

However, the general method based on Section [2] does not share those limitations and 
one can possibly find a Bellman function B that proves (13 .ip with smaller exponents dij. 
In some particular cases the procedure of reducing the general term to the "simpler" 
ones can be employed recursively rather than inductively, thus avoiding to deal with 
unnecessary paraproduct-type terms. 

As an example consider a cycle of length An, i.e. 

E — {(1, 1), (1,2), (2, 2), (2, 3), (3, 3), (3, 4), ... , (2n, 2n), (2n, 1) } 

and suppose 1.2 e S. We can estimate the general term by separating brackets [-} Xi or 
(■) Xi for odd and even i: 

+ - [(F 2i2 (x 2 ,y 2 )F 2i 3(x 2 ,y 3 )) 2 J y2jS/3 [[F4 i 4(x4, ? /4)F4 i5 (x4,y5)]^] y4iS/5 

This gives (13. ip with dij = 2 and consequently also (I1.3P in the range 2 <Pij < oo, 
rather than 2n<pij < oo. 

Determining the full range of exponents {Pi,j)a,j)eE fo r which (II. 3p holds remains an 
open problem. The only counterexamples we know are at the boundary of the Banach 
simplex, see [6]. Certain extensions are possible in the class of mixed-norm L p spaces, 
but they still face similar restrictions. 
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6.2. A remark on non-bipartite graphs. We close this section by commenting 
on our setup. Even though it is possible to associate multilinear forms to general 
undirected graphs, the method cannot be applied to forms arising from graphs that 
are not bipartite. For instance, a multilinear form associated to a triangle, i.e. a cycle 
of length 3, could be 

A A (F, G, H) = I 7 !' 7 ' / F ( x ' v) G (y> Z W Z > x ) V^Wj (y)if&(z) dxdydz, 

\I\ = \J\ = \K\ 
c(I,J,K)=0 

where c(I, J, K) = is some constraint, making the sum effectively indexed by only 
two of the intervals. Such forms seem to share many characteristics with the two- 
dimensional "triangular" Hilbert transform, 

A A bht(-F, 67, H) = [ (p.v. / F(x - t,y) G(x,y - t)^) H(x,y) dxdy, 

arising from "singular bilinear averages" introduced in |5j and for which no L p bounds 
are known at the time of writing. 
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